Network Analysis with the Enron Email Corpus
نویسندگان
چکیده
منابع مشابه
Network Analysis with the Enron Email Corpus
We use the Enron email corpus to study relationships in a network by applying six different measures of centrality. Our results came out of an in-semester undergraduate research seminar. The Enron corpus is well suited to statistical analyses at all levels of undergraduate education. Through this article’s focus on centrality, students can explore the dependence of statistical models on initial...
متن کاملRecommending Recipients in the Enron Email Corpus
Email is the most popular communication tool of the internet. In this paper we investigate how email systems can be enhanced to work as recipient recommendation systems, i.e., suggesting who recipients of a message might be, while the message is being composed, given its current contents and given its previously-specified recipients. This can be a valuable addition to email clients, particularl...
متن کاملAnnotating Subsets of the Enron Email Corpus
We present an annotation project for two subsets of the Enron email corpus. The first is a subset of the UC Berkeley Enron Email Analysis Project and the second consists of a portion of emails from the Voice Transcripts Email Correlated Corpora. Parts of the automatic content extraction (ACE) annotation guidelines, extended for the email domain are used for annotation. We also categorize the em...
متن کاملAnnotating the Enron Email Corpus with Number Senses
The Enron Email Corpus provides “Real World” text in the business email domain, which is a target domain for many speech and language applications. We present a section of this corpus annotated with number senses labelling each number as a date, time, year, telephone number etc. We show that sense categories and their frequencies are very different in this domain than in newswire text. The anno...
متن کاملThe Enron Corpus: A New Dataset for Email Classification Research
Automated classification of email messages into user-specific folders and information extraction from chronologically ordered email streams have become interesting areas in text learning research. However, the lack of large benchmark collections has been an obstacle for studying the problems and evaluating the solutions. In this paper, we introduce the Enron corpus as a new test bed. We analyze...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Statistics Education
سال: 2015
ISSN: 1069-1898
DOI: 10.1080/10691898.2015.11889734